Server-based browser system

ABSTRACT

A server-based browser system provides a server-based browser and a client system browser. The client browser reports the position of a user click on its display screen, hotspot ID, or URL to the server-side browser which retrieves a Document Object Model (DOM) model and view tree for the client and finds the location on the Web page that the user clicked on using the coordinates or hotspot ID received from the client. If there is a script associated with the location, it is executed and the resulting page location is requested from the appropriate server. If there is a URL associated with the location, it is requested from the appropriate server. The response Web page HTML definition is parsed and a DOM tree model is created which is used to create a view tree model. The server-side browser retrieves a style sheet, layout algorithms, and device constraints for the client device and lays out the view model using them onto a virtual page and determines the visual content. Textual and positional information are highly compressed and formatted into a stream and sent to the client browser which decodes the stream and displays the page to the user using the textual and positional information.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 12/535,550 entitled “SERVER-BASED BROWSER SYSTEM,” filed on April Aug. 4, 2009, which is a continuation of U.S. Pat. No. 7,587,669 entitled “SERVER-BASED BROWSER SYSTEM,” filed on Apr. 16, 2004, which is a national stage entry of Patent Cooperation Treaty (PCT) Application No. PCT/US02/11420, filed on Apr. 9, 2002, which claims benefit of Provisional App. No. 60/282,194 filed on Apr. 9, 2001, the entirety of which are incorporated by reference herein in their entireties.

BACKGROUND OF THE INVENTION

1. Technical Field

The invention relates to Web browsing technology. More particularly, the invention relates to a system and method for fast delivery of and fast rendering of Web pages on a lightweight network device.

2. Description of the Prior Art

A typical Web page today contains not only HTML content but also, by reference to, other content elements such as graphics and external style sheets. A browser acquires the referenced content elements by spawning worker threads that retrieve the data via multiple HTTP-TCP/IP connections. This model works well for desktop systems containing large amounts of processing power and RAM. However, it is usually not feasible for mobile, lightweight, embedded devices in which processing power and RAM are much limited.

The process for proper HTML parsing, validation, and layout code is normally quite complex and requires large amount of CPU and RAM to execute. Size and manufacturing cost have been major concerns in providing embedded devices with a capacity to implement the process. Although embedded devices are getting more and more processing power and RAM, the requirements of a proper browser engine still outdistance the resources available to even the highest end of embedded devices.

High levels of CPU usage are detrimental to the preservation of battery life, which is a very important consideration for today's mobile devices. Heavy loads in HTTP communications, HTML parsing, HTML validation and layout may largely shorten the battery life.

Bandwidth usage is also a major constraint that must be considered in designing a browser model for embedded devices. Text based HTML content, although compressible, is already quite large with respect to the limited transmission speed of today's wireless networks. When image content designed for higher resolution and larger form factor display is included, one may quickly end up consuming a lot of wasted bandwidth transmitting redundant, not necessarily usable data to mobile devices.

Although various browsing technologies have been already presented in the marketplace, many of them fail to meet the tight constraints and stringent requirements of the embedded devices. Writing a cross platform code base for these devices is an even bigger challenge, given the multitude of operating systems, memory models and processors available for embedded device manufacturers.

It would be advantageous to provide a server-based browser system that offloads heavy weight components of a traditional browser to a back end server. It would further be advantageous to provide a server-based browser system that leaves a lightweight client device with a minimal task of implementing a customizable interface to display a pre-rendered representation of Web pages.

SUMMARY OF THE INVENTION

The invention provides a server-based browser system. The system offloads heavy weight components of a traditional browser to a back end server. In addition, the invention leaves a lightweight client device with a minimal task of implementing a customizable interface to display a pre-rendered representation of Web pages.

A preferred embodiment of the invention provides a server-based browser and a client system browser. The client browser determines the position of a user click on its display screen and sends a navigation request containing the coordinates of the click, or a hotspot ID, to the server-side browser. The client browser will also accept a specific URL from a user and places the URL in the navigation request.

The server-side browser retrieves a Document Object Model (DOM) model and view tree from a storage device for the client. It then finds the location on the Web page that the user clicked on using the coordinates or hotspot ID received from the client. If there is a script associated with the location, it is executed and the resulting page location is requested from the appropriate server. If there is a URL associated with the location, it is requested from the appropriate server.

The response Web page HTML page definition is parsed and a DOM tree model is created. The server-side browser uses the DOM tree and creates a two-dimensional view tree model, and stores both trees on the storage device.

The server-side browser retrieves a style sheet, layout algorithms, and device constraints for the client device from the storage device. It lays out the view model using the style sheet, layout algorithms, and device constraints onto a virtual page.

If the client browser is a subset of the server-side browser engine, then the DOM tree and view tree are compressed into a stream and sent to the client browser which decodes the stream, translates the DOM tree and view tree, and displays the page to the user.

Otherwise, the server-side browser determines the visual content on the virtual page and the location of the visual content. Visual content and positional information are highly compressed and formatted into a stream and sent to the client browser which decodes the stream and displays the page to the user using the visual content and positional information.

The DOM tree and view tree and all state information (including Javascripts and cookies) for the client are stored on the storage device by the server-side browser.

Other aspects and advantages of the invention will become apparent from the following detailed description in combination with the accompanying drawings, illustrating, by way of example, the principles of the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block schematic diagram of a preferred embodiment of the invention showing a server-side browser and a client browser according to the invention;

FIG. 2 is a block schematic diagram of a server-side browser and renderer and client device according to the invention;

FIG. 3 is a block schematic diagram of a server-side browser engine and client device according to the invention;

FIG. 4 is a block schematic diagram showing an HTML document tree according to the invention;

FIG. 5 is a block schematic diagram showing a view tree according to the invention;

FIG. 6 is a block schematic diagram of a view layout of a Web page according to the invention;

FIG. 7 is a block schematic diagram of a flowchart of an embodiment of the invention according to the invention; and

FIG. 8 is a block schematic diagram of a task viewpoint of a preferred embodiment of the invention according to the invention.

DETAILED DESCRIPTION OF THE INVENTION

The invention is embodied in a server-based browser system. A system according to the invention offloads heavy weight components of a traditional browser to a back end server. In addition, the invention provides a server-based browser system that leaves a lightweight client device with a minimal task of implementing a customizable interface to display a pre-rendered representation of Web pages.

A typical desktop browser requires a large amount of processor time to handle the load of obtaining Web page information and parsing the HTML code in the page. Secondary requests within the HTML code also require a large amount of processing time. Simple client devices such as PDAs, cellphones, and low CPU power set top boxes cannot handle the amount of processing power required to perform typical Web browsing.

Additionally, bandwidth on devices such as cellphones are limited and are charged to the user on a byte basis. This means that the large data transfers involved in typical Web browsing are too expensive in terms of bandwidth limitations and cost to the user.

Referring to FIG. 1, the invention provides a server-based 101 browser 104 that converts Web pages to a minimal format adaptable to different types of wired/wireless client devices such as cellphones, PDAs, and set top TV boxes. A light weight browser 105 is provided on a client device 102 to accept highly compressed streams of data from the server-side browser 104. The server-side browser 104 accesses information across the Internet or intranet 103.

With respect to FIG. 2, a user using a client device 206, 207 requests a Web page from a light weight browser or the light weight browser itself requests a Web page. The request is sent from the client 206, 207 to a server-side browser 201. The server-side browser 201 identifies the Web page that the user requested and sends a request to the proper server across the Internet or intranet.

When the requested Web page is returned to the server-side browser 201, the server-side browser 201 loads the initial HTML document. The received HTML document consists of text, which comprises the content of the document, and tags, which define the structure and appearance of the document. The server-side browser 201 looks at any dependencies within the document (e.g., referenced images, Javascript files that need to be loaded remotely, etc.) and opens multiple sockets to get the required information across the Internet or intranet.

The server-side browser 201 gathers the Web page information and lays out the information to calculate how to display the information on a particular device. The server-side browser 201 makes a second pass at the layout information and then transcodes the information into a very tight data stream that can be represented on the client 206, 207. The stream includes positional information that is not present in the normal HTML for the Web page.

The server-side browser 201 takes all of the HTML information for a Web page and represents it as two dimensional content on the server. This two-dimensional representation helps the browser determine the exact position information positional information for all of the pieces of a particular HTML Web page. If there are any images or other dependencies, the server-side browser 201 scales (or crops) the images down and/or preprocesses the images (e.g., reduce the bit depth of an image, dither the image, etc.) to match the capabilities of the client 206, 207. The browser then creates a compressed stream that is sent down to the client. This compressed stream contains text and images with exact positional information (that tells where the pieces of content fit on the page), and hot spot information. The browser maps each rectangular hotspot region to a related URL. Thus, no extraneous information, such as hyperlink URLs, or Cookie Data is sent down to the client.

The server-side browser 205 uses device characteristics information 202 for each supported device type, consisting of the associated default CSS style sheet, a list of supported fonts and their text metrics, device aspect ratios, pixel depth, color palette, form control dimensions, preferred layout algorithms, a DTD specifying level of HTML compliance, JavaScript policies, the User-Agent String to represent to a Web Server, and the compressed stream format encoder, to properly determine the ideal representation of Web content on the device. Various examples of potential clients with vastly different device characteristics could include: a cellphone 206 with a minimal CPU and minimal RAM; a PDA client with a small display but powerful CPU and high RAM; or a TV box 207 with a large display, high capacity CPU, low RAM, and non TCP/IP based communications. The server-side browser 205 utilizes many different layout algorithms to properly render information with these various classes of clients that all have different: display, CPU, RAM, and IO constraints.

A lightweight client browser is provided on the clients 206, 207 that accepts a standard information format from the server-side browser 201 and displays the data streams. This means that the server-side browser 201 can be updated at any time without affecting the clients 206, 207.

Referring to FIGS. 3, 4, and 5, a preferred embodiment of the invention provides a server-side browser engine 302 that accepts HTML (e.g., HTML 4.0 content with support for CSS version 1) pages 303 and parses the HTML 304. The parser 304 creates a Document Object Model (DOM) object hierarchy model 305. The DOM model 305 is a tree representation of the HTML in the Web page, which is well known in the art.

A style sheet 306 tells the server-side browser 301 what fonts to use for standard HTML tags (e.g., bold), border sizes, and other stylistic information for each supported client device. A device information file 307 is also used by the browser that tells the server-side browser 301 the limitations of a client device (e.g., the size of a text control, fonts available, aspect ratio, etc.). The server-side browser 301 discovers what device type the client is when the client signs onto the server.

Using the style sheet 306 and the device information 307 for the particular device, the server-side browser 301 creates a view tree 308 which is a two-dimensional model of the parent-child relationship between viewable objects of the DOM model 305.

A DOM tree is shown in FIG. 4. The HTML 401 is the root of the tree and underneath has a Body tag 402. Under the Body tag 402 is an Anchor tag 403 which has Text 404 underneath. There is also a Table 405 beneath the Anchor tag 403. The Table 405 has a row 406 which has a cell 407. The browser makes views of DOM tree elements that are visible. For example, a Text view 408 is made of the Text 404, a Table view 409 is made of the Table 405, a TR view 410 is made of the row 406, and a TD view 411 is made of the cell 407.

An exemplary view model is shown in FIG. 5, where a Body view 501 is the root of the tree. A Table view 502 sits below the Body view 501 and has a TR view 503 below it. A TD view 504 is below the TR view 503. The TR view 504 has both a text view 505 and an image view 506 below it.

The server-side browser 301 then lays out the view model 308 on a virtual page. Each view in the view model 308 has a rectangular area of information attached to it. The server-side browser 301 lays out the views using specific layout algorithms for each view (e.g., tables, block levels, etc.) and uses the style sheet 306 and device information 307 for the particular device to make sure that any special characteristics or behaviors of the device are taken into account. For example, for a TV box client, there may be a fixed table width defined in the view and fixed table widths are ignored by the TV box, or, one pixel width table borders are only supported on the TV box client. An example of a layout algorithm is every cell in a table is considered a row in itself, for a certain device each cell is to be laid out vertically instead of the normal horizontal layout.

With respect to FIG. 6, the server-side browser performs the first layout pass which lays out all of the views of the page 601. The page 601 is a virtual view of how the page looks. In this example, a body view 605 has a table view 604 within it. The Table view 604 has a row 603 which has a cell with text in it 602. Each view has a rectangular area that it represents and a parent-child relationship. This allows the server-side browser to walk through the viewable area to see what it represents.

A second pass is made to determine what the real content is on the page. HTML authors often define multiply nested tables to constrain a layout of an HTML page. The server-side browser throws away the unnecessary information that does not contribute to the appearance of the page (e.g., not renderable). The server-side browser is concerned with the physical positioning of content elements within the page. For example, the browser finds text 602 on the page 601 and accumulates the text into a string and compresses it to send to the client. The server-side browser also sends the client information that tells the client where the text should be displayed in its viewing area. This information is placed in a binary format that the client device feeds into.

The DOM tree and view tree and all state information (including Javascripts) for the client are preserved on the server. This is to ensure that, when the user is interacting with a page on the client, the server-side browser knows where the user has clicked on the page. The server-side browser does not send URL links to the client. This means that the client only has positional or hotspot information relating to where the user clicked or pressed on the page and sends that coordinate or hotspot information to the server-side browser. The server-side browser looks at the DOM tree and view model to find where the coordinates map onto the page. The view model is used to find the location of the coordinates or hotspot ID and the DOM tree is used to find the URL or script associated with the clicked area. When the server-side browser finds the area that corresponds to location where the user clicked, the server-side browser executes whatever script or URL request that corresponds with the area. The server-side browser then updates the client with the new page if the action was navigational, for example.

Cookie data are impractical to place on the client device. The server-side browser caches all of the user's cookies in a user state that the server-side browser preserves on the server for each user.

A session object exists on both the server and the client. The session protocol is used by the client and server to communicate information. For example, when the client first signs onto the server, it tells the server its configuration information and other information. The server-side browser looks into its database and finds the client device's style and device information.

Referring to FIG. 7, a flowchart of an another preferred embodiment of the invention is shown. The client browser 711 sends a navigation request to the server-side browser engine 701. The browser session 702 retrieves the client specific style sheet 703 and layout algorithms 704.

The browser session 702 fetches the requested Web page from the Internet or intranet. The HTML Web page 705 is parsed and a DOM tree model is created 706. The DOM tree model is then used to create a two-dimensional view tree model 707.

The invention lays out the view tree based on the client specific style sheet and layout algorithms 708. Any additional layouts are performed on the view tree to fit the client device constraints 709. The resulting two-dimensional view tree and DOM tree are encoded in a lightweight format and sent to the client which decodes the format 710 and displays the resulting page to the user through the client browser 711.

With respect to FIG. 8, a task viewpoint of the invention is shown. The Client Browser module 812 determines the position of a user click or press on its display screen and sends a navigation request containing the coordinates of the click or press, or the hotspot ID, to the server-side browser through the Client Session module 811 to the Server Session module 804. The Client Browser module 812 will also accept a specific URL from a user and places the URL in the navigation request. The Server Session module 804 forwards the navigation request to the Find Position module 805. The Find Position module 805 accepts Specific URLs from the client, coordinates of a user's click or press, and hotspot IDs 810.

The Find Position module 805 forwards any specific URL to the Web Page Manager 902. Otherwise, the Find Position module 805 retrieves the DOM model and view tree from the storage device 807 for the particular client. It then finds the location on the Web page that the user clicked or pressed on using the coordinates or hotspot ID received from the client. If there is a script associated with the location, it is executed and the resulting page location is sent to the Web Page Manager 902. If there is a URL associated with the location, it is forwarded to the Web Page Manager 902.

The Web Page Manager 802 receives the URL from the Find Position module 805 and formats a request for the Web page. The request is sent to the Web Session module 803. The Web page is returned to the Web Session module 803 and is forwarded to the Web Page Manager 802.

The Web page Manager 802 forwards the HTML Web page definition to the Parse HTML module 801. The Parse HTML module 801 parses the HTML and creates a DOM tree model which it stores on the storage device 807.

The Web page Manager 802 then activates the Create View module 806. The Create View module 806 uses the DOM model stored on the storage device 807, creates a two-dimensional view tree model, and stores it on the storage device 807.

Once the view tree is completed, the Web page Manager 802 retrieves the style sheet, layout algorithms, and device constraints for the specific client device are retrieved from the storage device 807. The Web page Manager 802 lays out the view model using the style sheet, layout algorithms, and device constraints.

If the Client Browser 812 is a subset of the server-side browser engine, then the DOM tree and view tree are compressed into a stream 809 and sent through the Server Session module 804 to the Client Browser 812 via the Client Session module 811. The Client Browser 812 decodes the stream, translates the DOM tree and view tree, and displays the page to the user.

Otherwise, the Web page Manager 802 performs the first layout pass and lays out all of the views of the page. The Web page Manager 802 then makes a second pass to determine the visual content on the page. Visual content and positional information are highly compressed and formatted into a stream 809 and forwarded to the Server Session module 804. The Server Session module 804 sends the stream to the Client Browser 812 via the Client Session module 811. The Client Browser 812 decodes the stream and displays the page to the user using the visual content and positional information.

The DOM tree and view tree and all state information (including Javascripts and cookies) for the client are stored on the storage device 807 by the Web page Manager 802.

Although the invention is described herein with reference to the preferred embodiment, one skilled in the art will readily appreciate that other applications may be substituted for those set forth herein without departing from the spirit and scope of the present invention. Accordingly, the invention should only be limited by the Claims included below. 

1. (canceled)
 2. A method comprising: receiving a navigation request from a client device, the navigation request including display screen coordinates corresponding to a position on a display of the client device; identifying a script associated with the display screen coordinates; executing the script and requesting an associated page from a corresponding server; receiving the page from the corresponding server; creating a layout of the page by determining a plurality of content elements present in the page based at least in part on device characteristics information associated with the client device, the position of each content element within said page represented by position data comprising two-dimensional coordinates; creating a compressed stream containing the plurality of content elements and the position data; and sending said compressed stream and the layout to the client device.
 3. The method of claim 2, further comprising: identifying any images associated with the page; and scaling the images based at least in part on the device characteristics information, before creating the compressed stream.
 4. The method of claim 2, further comprising: identifying any images associated with the page; and cropping the images based at least in part on the device characteristics information, before creating the compressed stream.
 5. The method of claim 2, further comprising: identifying any images associated with the page; and reducing a bit depth of the image based at least in part on the device characteristics information, before creating the compressed stream.
 6. The method of claim 2, wherein the device characteristics information includes stylistic information, layout algorithms, and format encodings associated with the client device.
 7. The method of claim 2, wherein the client device is at least one of a cell phone or personal digital assistant (PDA) device.
 8. The method of claim 2, wherein the client device is a set top box.
 9. The method of claim 2, further comprising: receiving a sign on request form the client device; in response to receiving the sign on request, identifying a type associated with the client device; and based at least in part on the type, accessing the device characteristics information.
 10. The method of claim 2, wherein creating the compressed stream comprises: determining real content associated with the page; removing nested tables form the page received from the corresponding server; and accumulating text of the page into a string.
 11. A hardware apparatus for hosting a server-based browser that interacts with a plurality of client device types, the interactions comprising: receiving a navigation request from a client device, the navigation request including display screen coordinates corresponding to a position on a display of the client device; requesting a page from a corresponding server based at least in part on the display screen coordinates; receiving the page from the corresponding server; creating a layout of the page by determining a plurality of content elements present in the page based at least in part on device characteristics information associated with the client device, the position of each content element within said page represented by position data comprising two-dimensional coordinates; creating a compressed stream containing the plurality of content elements and the position data; and sending said compressed stream and the layout to the client device.
 12. The apparatus of claim 11, wherein the interaction further comprise: determining real content associated with the page; removing nested tables form the page received from the corresponding server; and accumulating text of the page into a string.
 13. The apparatus of claim 11, wherein the interaction further comprise: receiving a sign on request form the client device; in response to receiving the sign on request, identifying a type associated with the client device; and based at least in part on the type, accessing the device characteristics information.
 14. The apparatus of claim 11, wherein the interaction further comprise: identifying any images associated with the page; and cropping the images based at least in part on the device characteristics information, before creating the compressed stream.
 15. The apparatus of claim 11, wherein the stream includes a Document Object Model (DOM) tree.
 16. One or more non-transitory computer-readable media having computer-executable instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising: receiving a request from a client device, the request including display screen coordinates corresponding to a position on a display of the client device; identifying a hotspot associated with the display screen coordinates; requesting an associated page from a corresponding server associated with the hotspot; receiving the page from the corresponding server; creating a layout of the page by determining a plurality of content elements present in the page based at least in part on device characteristics information associated with the client device, the position of each content element within said page represented by position data comprising two-dimensional coordinates; creating a compressed stream containing the plurality of content elements and the position data; and sending said compressed stream and the layout to the client device.
 17. The one or more non-transitory computer-readable media of claim 16, wherein the device characteristics information includes stylistic information, layout algorithms, and format encodings associated with the client device.
 18. The one or more computer-readable media as recited in claim 16, having computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising: parsing the HTML code of the page to create a Document Object Model (DOM) tree; and sending the DOM tree as part of the compressed stream.
 19. The one or more non-transitory computer-readable media of claim 16, wherein the client device is at least one of a cell phone or personal digital assistant (PDA) device.
 20. The one or more non-transitory computer-readable media of claim 16, wherein the client device is a set top box.
 21. The one or more computer-readable media as recited in claim 16, having computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising: identifying any images associated with the page; and scaling the images based at least in part on the device characteristics information, before creating the compressed stream. 